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Loop formation between monomers in the interior of semifiexible chains describes 
elementary events in biomolecular folding and DNA bending. We calculate ana- 
lytically the interior distance distribution function for semifiexible chains using a 
mean-field approach. Using the potential of mean force derived from the distance 
distribution function we present a simple expression for the kinetics of interior loop- 
ing by adopting Kramers theory. For the parameters, that are appropriate for DNA, 
the theoretical predictions in comparison to the case are in excellent agreement with 
explicit Brownian dynamics simulations of worm-like chain (WLC) model. The inte- 
rior looping times {tjc) can be greatly altered in cases when the stiffness of the loop 
differs from that of the dangling ends. If the dangling end is stiffer than the loop 
then Tjc increases for the case of the WLC with uniform persistence length. In con- 
trast, attachment of flexible dangling ends enhances rate of interior loop formation. 
The theory also shows that if the monomers are charged and interact via screened 
Coulomb potential then both the cyclization (tc) and interior looping {tjc) times 
greatly increase at low ionic concentration. Because both Tc and tjc are determined 
essentially by the effective persistence length (Ip ) we computed Ip by varying the 
range of the repulsive interaction between the monomers. For short range inter- 
actions Ip nearly coincides with the bare persistence length which is determined 
largely by the backbone chain connectivity. This finding rationalizes the efficacy 
of describing a number of experimental observations (response of biopolymers to 
force and cyclization kinetics) in biomolecules using WLC model with an effective 
persistence length. 
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I. INTRODUCTION 

The kinetics of formation of contact between the ends of a polymer chain has a rich history.i^ 
Both experiments,—"^ theory,i^'^'^*^'^^ and simulation o^'^i^^i^^ have been used to address the ele- 
mentary event of the dynamics of end-to-end contact formation (or cyclization kinetics) (Fig^ 
A). Contact formation between two reactive groups separated by a certain distance along the 
chain is a basic intramolecular rate process in a polymer. Recently, there has been renewed 
interest in understanding the looping dynamics that has been studied both theoretically^*^*^^ 
and experimentallyi^'i^'i^'i^^ because of its fundamental importance in a number of biological 
processes. The hairpin loop formation is the elementary step in RNA folding,Ji structure for- 
mation in ssDNAj' —' -- and protein folding .i^^'22i2L22iS^ Cyclization in DNA has recently drawn 
renewed attention not only because of its importance in gene expression^iiS^ but also it provides 
a way to assess DNA's flexibility. The promise of using single molecule technique to probe the 
real time dynamics of polymer chains has also spurred theories and simulations of cyclization 
kinetics. Using loop formation times between residues that are in the interior as the most ele- 
mentary event in protein folding, it has been argued, using experimental data and theoretical 
expression for probability for loop formation in stiff chains, that the speed limit for folding is 
on the order of a 1 /is.— These examples illustrate the need to understand quantitatively the 
elementary event of contact formation between segments of a polymer chain. 

Even without taking hydrodynamic interactions into account theoretical treatment of cy- 
clization kinetics in polymer chains is difficult because several relaxation times and length and 
energy scales are interwined. At the minimum the variation of time scale for cyclization (r^) 
with polymer length is dependent on polymer relaxation time (t/j). In biopolymers additional 
considerations due to chain stiffness and heterogeneity of interactions between monomer (amino 
acid residue or nucleotides) must be also taken into account. Majority of the cyclization kinetics 
studies on synthetic polymers^ have considered examples in which the contour length (L) of the 
polymer is much greater than its persistence length (Ip). In contrast, loop formation dynamics 
in biopolymers have focused on systems in which L/lp is relatively small. In disordered polypep- 
tide chains L/lp can be as small as 3 j"^-'- ' 7- while in DNA L/lp < ifLLS^ Thus, it is important to 
develop theoretical tools for the difficult problem of loop formation dynamics for arbitrary L 
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and Ip. Despite the inherent complexities in treating loop formation in biopolymers it has been 
found that the use of polymer-based approach is reasonable in analyzing experimental data on 
cyclization kinetics in proteins^'^-^ and DNA.— 

In this paper we are primarily concerned with the looping dynamics between interior segments 
of a semifiexible chain. While a lot of theoretical and experimental works (mentioned above) 
have been done on the end-to-end looping (Fig|T]-(a)), only a few studies have been reported 
on the contact formation between monomers in the interior of a chain (interior looping) (FigQ- 
^jj-j^ 29^30^31^32^33 ^j^gj^g are a few reasons to consider kinetics of interior looping. (1) The biological 
events such as hairpin formation and DNA looping often involve contact formation between 
monomers that are not at the ends of the chain. For example, it is thought that the initiation 
of nucleation in protein folding occurs at residues that are near the loop regions.— The residues 
that connect these loops are in the interior of the polypeptide chain. Similar processes are 
also relevant in RNA folding.^ (2) It is known that for flexible chains with excluded volume 
interactions (polymer in a good solvent) the probability of loop formation is strongly dependent 
on the location of the two segments. For large loop length (S) the loop formation probability, 
P{S), in three dimensions for chain ends ~ S^^ where 6i ^ 1.9 while P{S) ~ S^'^ with $2 ~ 2.1 
for monomer in the interior. Although the values of 9i and 62 are similar it could lead to 
measurable differences in loop formation times.— 

The rest of the paper is organized as follows. In section II we present the physical consider- 
ations that give rise to the well-established results for for flexible chains. The extension of 
the arguments for flexible chains to semifiexible polymers suggests that the local equilibrium 
approximation can be profitably used to analyze both cyclization kinetics and interior looping 
dynamics. The basic theory for the equilibrium distance distribution between two interior 
segments Si and S2 (Fig|T]-(b)) is presented in section III. Using the equilibrium distribution 
function and adopting Kramers theory and following the suggestion by Jun et. al.,^'^ we obtain 
an analytical expression for time scale tjc for interior contact formation in section IV. Explicit 
results of simulations of worm-like chain (WLC), which validate the theory, are presented in 
section V. In section VI we consider the kinetics of interior loop formation in WLC in which 
the stiffness of the loop is different from that of the dangling ends. Section VII describes 
the consequences of screened Coulomb interaction between monomer segments on cyclization 
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kinetics and interior looping dynamics. Because the results in section VIII are expressed in 
terms of a renormalized persistence length {Ip ) of WLC we present simulation results for Ip 
variation for a number of potentials that describe interactions between monomers in section 
VII. The conclusions of the article are summarized in section IX. 



II. PRELIMINARY CONSIDERATIONS 

The pioneering treatment of loop formation dynamics due to Wilemski and Fixman (WF)^*^ 
has formed the basis for treating cyclization kinetics in flexible polymer chains. Using 
a generalized diffusion equation for the probability density, ^^^^q^ = CFpP{{T^^},t) — 
A;iS({r^})P({r^}, t) {Cpp is a generalized diffusion operator, kS is a sink term) for a N-segment 
polymer, and local equilibrium approximation within the sink, WF expressed the cyclization time 
Tc in terms of an integral involving a sink-sink correlation function. From the WF formalism 
and related studies it is known that even in the simplest cases (ideal chains or polymers with 
excluded volume interactions) the validity of the local equilibrium approximation depends on 
the interplay between and the chain relaxation time, tr. If Tc ^ then the local equilibrium 
approximation is expected to hold because the polymer chain effectively explores the available 
volume before the monomers at the end (reactive groups) form a contact. In this situation, 
Tc can be computed by considering mutual diffusion of the chain ends in a potential of mean 
force {F{Re)). For ideal chains, F{Rc) = -kBTlogP{Re) ~ 3kBTRl/2R^ where Re is the 
end-to-end distance, R ~ aN^^"^ is the mean end-to-end distance, a is the size of the monomer, 
T is the temperature, and ks is the Boltzmann constant. By solving such an equation sub- 
ject to the absorbing boundary condition, Szabo, Schulten, and Schulten (SSS)^ showed that 
'Tsss = ToN'^^"^. Simulations^ and theory^ show that if the capture radius for contact formation 
is non-zero, and is on the order of a monomer size then Tc ~ ~ TiN"^^^^ where u = 1/2 
for Rouse chains and z/ ^ 3/5 for polymers with excluded volume, and D^. is a mutual diffusion 
coefficient. The use of these theories to analyze the dependence of Tc on N in polypeptides shows 
that the physics of cyclization kinetics is reasonably well described by diffusion in a potential 
of mean force F{Re) which only requires accurate calculation of P{Re) the end-to-end distri- 
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bution function . For describing interior looping times tjq for contact between two interior 
monomers Si and S2 we need to compute P{Ri2, \ si — S2I) where R12 is the distance between 
si and S2. With P{Ri2, |s2 — si|) in hand Tc can be computed by solving a suitable diffusion 
equation. 

Because the use of F{Re) in computing Tc and tjc is intimately related to chain relaxation 
times it is useful to survey the conditions which satisfy the local equilibrium approximation. By 
comparing the conformational space explored by the chain ends compared to the available volume 
prior to cyclization— the validity of the local equilibrium approximation in flexible chains can 
be expressed in terms of an exponent 9 = Here d is the spatial dimension, the correlation 
hole exponent (des Cloizeaux exponent)^ g describes the probability of the chain ends coming 
close together, and z is the dynamical scaling exponent (r/j ~ R^). If > 1 the local equilibrium 
approximation is expected to hold and is determined essentially by the equilibrium P{Re) as 
Re ^ a the capture radius. Using the scaling form of P{Re) for small Re P{Re) ~ ^ i^Y 
and R ~ N'^ (z/ is the Flory exponent) we find Te ~ N^^'^'^^\ For Gaussian chains z/ = 1/2 and 
5f = and hence tsss ^ Tc ^ N^^'^. This result was obtained fifty years ago by Jacobsen and 
Stockmayer.— However, in the free-draining case {z = 4, g = 0, v = 1/2, c? = 3), 9 < 1 and 
hence the condition Te ^ is not satisfied. In this case Te ^ N^" ~ A^^. Thus, for ideal 
Gaussian chains it is likely that tsss < '^c < Ty/pM Indeed, recent simulations show that if the 
number of statistical segments is large (> 20) then for ideal chains Te ~ A^^ which signals the 
breakdown of the condition Te ^ tr. Experiments on cyclization of polypeptide chains show that 
Te ~ N^^'^a is obeyed for in the range 10 < < 20 (see Fig. (5) in Ref.— ). Deviations from 
ideal chain results are found for A^ < 10, either due to chain stiffness^- or sequence variations.-^ 
For polymer chains in good solvents with hydrodynamic interactions {d = 3, g = 5/18, z = 3, 
and u = 3/5), 9 = 59/54 > 1. Thus, in real chains the local equilibrium approximation may be 
accurate. 

For stiff chains bending rigidity severely restricts the allowed conformations especially when 
the contour length (L) is on the order of the persistence length (/p). Because of high bending 
rigidity the available volume is restricted by thermal fluctuations. Clearly in this situation, 
the chain is close to equilibrium. This may be the case for short DNA segments. In effect 
these chains satisfy the Tc > tr condition which enables us to calculate Te or tjc by solving an 



6 



appropriate one dimensional diffusion equation (see below) in a suitable potential of mean force. 

Effect of chain Stiffness : Many biopolymers are intrinsically stiff and are better described by 
worm-like chain (WLC) models. The persistence length, which is a measure of stiffness, varies 
considerably. It ranges from (3 - 7) A (proteins), (10 - 25) A (ss-DNAi2-^ and RNA,i^) 50 
nm for ds-DNA. Typically, loops of only a few persistence length form, which underscores the 
importance of chain stiffness. In order to correctly estimate the loop closure time, consideration 
of the stiffness in the loop closure dynamics is necessary unless the polymer looping takes place 
between the reactive groups that are well separated and the chain length L is long. If L ^ /p 
(persistence length), the looping dynamics will follow the scaling law for flexible chains. However, 
at short length scales loop dynamics can be dominated by chain stiffness.-^ If the chain is stiff 
then WLC conformations are limited to those allowed by thermal fluctuations. In this situation, 
the time for exploring the chain conformations is expected to be less than r^. Thus, we expect 
local equilibrium to be a better approximation for WLC than for long flexible chains. 

Recently, Dua et. al.^'^ have studied the effect of stiffness on the polymer dynamics based 
on Wilemski-Fixman formalism and showed, that for free-draining semiflexible chain with- 
out excluded volume Tc ~ ]y2.2~2.4 moderate values of stiffness. However, the proce- 
dure used to obtain this result is not complete, as recognized by the authors, because they 
use a Gaussian propagator ^(r, tjr', 0) = (a^^py^n^) ' ^^P (^^&y^^^0(|y) which is not 
valid for WLC. The end-to-end distance distribution becomes a Gaussian at equilibrium, 
limt^oo ^(r, ^|r', 0) = Peq{^) = {^ 2idr^) ) ^-^P (~"^r^)' which is incorrect for semiflexible chain 
especially when Ip ^ L (see FigOand Refs .^^i"^^ ). 

As an alternative method we include the effect of chain stiffness assuming that local equilib- 
rium approximation is valid. This is tantamount to assuming that > t/j which, for reasons 
given above, may be an excellent approximation for WLC.^^ In this case we can compute Tc 
by solving the diffusion equation in a one dimensional potential F{Re) = —ksT In P{Re) where 
P{Re) is the probability of end-to-end distance distribution for WLC. For the problem of in- 
terest, namely, the computation of tjc, we generalize the approach of Jun et. al.^'^ who used 
Kramers theory in the effective potential F{R(.) to obtain r^. In general, the time for cyclization 
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can be calculated using 

r,= Td^e^^fe^i r dze-^^^^^ (1) 

J a J y 

where a is the capture (contact) radius of the two reactive groups. We show that Eq.([Q) provides 
accurate estimates of r^, thus suggesting the local equilibrium approximation is guaranteed. 

Here, we address the following specific questions: What is the loop formation time between 
the interior segments in a semiflexible chain? Does the dangling ends (Fign-(b)) affect the 
dynamics of loop formation? How does the effect of interaction between monomer segments 
(e.g. excluded volume, electrostatic interaction) affect loop closure kinetics in WLC models? 



III. DISTANCE DISTRIBUTION FUNCTION BETWEEN TWO INTERIOR 

POINTS 

A key ingredient in the calculation of the potential of mean force is appropriate distribution 
function between the two monomers that form a contact. In Refs.-^^iSi the equilibrium end-to-end 
(i?e) radial distribution function of a semiflexible chain P{Re) was obtained in terms of the per- 
sistence length (Ip) and the contour length (L) . Despite the mean field approximation employed 
in Refs .^^'^^ the distribution function P{Re) is in very good agreement with simulations.^^ The 
simplicity of the final expression has served as a basis for analyzing a number of experiments 
on proteins,— RNvM^ and DNA.— In this section, we use the same procedure to calculate the 
distribution function P{Ri2]lp, Si, S2, L) where < si, S2 < L, and R12 is the spatial distance 
between si and S2- 

For the semiflexible chain in equilibrium we write the distribution function of the distance 
R12 between si and S2 (FigH} along the chain contour as 

u{s)ds))MF 

^ I ^[u(^)]^(Ri2 - n{s)ds)^MFHs)] 

JD[m{s)]^mfHs)] ^ ^ 

where u(s) is a unit tangent vector at position s. The exact weight for the semiflexible chain is 
\l/[u(s)] oc exp [— ^ Jq ds (fj)^] n^l^^l-^) ~ The nonlinearity, that arises due to the restric- 
tion u^(s) = 1, makes the computation of the path integral in Eq.Q difficult. To circumvent 
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the problem we replace \&[u(s)] by the mean field weight '^mf[^{s)]^ 

^mfHs)] cx exp [-| \s-X l\n\s) - l)ds - S[{ul - 1) + (u^ - 1)]]. (3) 

The Lagrange multipliers A and 6, which are used to enforce the constraint u^(s) = 1,^^ will 
be determined using stationary phase approximation (see below). The path integral associated 
with the weight \E'mf[u(s)] is equivalent to a kicked quantum mechanical harmonic oscillator 
with "mass" Ip and angular frequency = y^2X/lp. Using the propagator for the harmonic 
oscillator 

3 

f7ismh{ns)Y^ ( ^ (u2 + u2) cosh - 2u, • uo 

and defining Vtp = ^ the isotropic distribution function becomes 

■^^-p j (iuo(iu^i(iu^2'^ULe"'^"oZ(uo,U5i; si) 



2y 2A' 

X e''<Zius„UL;L-S2). (5) 

By writing the distribution function as G{Ri2, Si, S2) = /-iL '^'^^xp (— JF[A, 6]) it is clear 

that the major contribution to G (in the thermodynamic limit L —>■ 00) comes from the saddle 
points of the free energy functional JF[A,5], i.e., fx = ^ = 0. The functional J^[A, 5] is (see 
Appendix A for details of the derivation) 

J^[X,S] = -(LA + 25) 

'sinhQL,^, ^9 . 3, A^ 



3, /sinhQL.^a ^r, , 3, 

+ - In — {6^ + + 25np cothl^L) - - In 



2 Q(si,s2;A,5) 

^ g(si,s2;A,5) ^ ^ 

To obtain the optimal values of A and 5 we first take the L — > 00 limit and then solve 
stationarity conditions fx = ^ = 0. Technically, the optimal value of b and A should be 
calculated for a given L and then it is proper to examine the L — 00 limit. The consequences of 
reversing the order of operation are discussed in Appendix B. Using the first procedure (taking 
L ^ 00 first) we obtain Q{si, S2; X,S) \s2 — si\X in the limit L ^ S2 ^ si ^ 1, and thus 
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^[A, S] becomes 



J^[\,6] ^ -(LA + 25) 



\S2 - Si 


^12 


L 


\S2 - Sl\ 


2 



2 'np'rip ' ' 2 P 2 A |s2 - sil 

2 ^ 

+ ^ln[^(^ + in + ^ln|.2-3i|-25 (7) 

where we have omitted numerical constants. The major contribution to the integral over A and 
5 comes from the sets of A and 5 which pass the saddle point of a stationary phase contour on 
the Re{jF} plane. Since the term linear in L dominates the logarithmic term in Ip even when 
L/lp ~ 0{1), the stationary condition for A can be found by taking the derivative with respect 
to A by considering only the leading term in L (cf. see Appendix B for details). The stationarity 
condition leads to 



4 1 _ Na-^il 2 
^ L ^ 



where r = |^^^^| with < r < 1. Similarly, the condition for 5 can be obtained as 

5 = \-^,. (9) 

Determination of the parameters A and 5 by the stationary phase approximation amounts to 
replacing the local constraint u^(s) = 1 by a global constraint (u^(s)) = 1.^* Finally, the 
stationary values of A and 5 in the large L limit give the interior distance distribution function: 

^(-^12,52 Si)-— ji^r^Tr^T^ <^^P ( ^77l£2^£T|T7| M2-siL2\^' ^^^^ 



The mean-field approximation allows us to obtain a simple expression for the internal segment 
distance distribution function. The previously computed P{Re)^ can be retrieved by setting 
152 — Si\ = L. The radial probability density, for the interior segments, in three dimensions, for 
semiflexible chains is 

P(r;s2- si,t) = AnC exp ( ) (11) 

^ ^ k2-'^il ^2^9/2 ^ 1^ 1^2-^11 ^^]^ \s2-si\ 2y ^ ' 
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where r = -R12/IS2 — si\ = R/\s2 — si\ and t = \s2 — si\/Iq with Iq = |/p. The normahzation 
constant C is determined using P{r, s; t)dr = 1. The integral is evaluated by the substitution 



= (^^-^p[-2v^Xoe-"(i+^o)(i5 + 2a(6 + 5x1 + 2a{l + xlY)) 

+ a^e~''^/^eTf[^/^xo]{l + 3a~^ + ^a-^)]-^ (12) 



where a = ^^7=^, = ^ iji^ji^) and erf (x) is the error function. The peak in the distribution 
function is at 

rmax = \ \ I J-- (13) 

V l\S2-Si\/L 

where ?7 = | - , \sl-.^\ ■ For \s2 - si| = L, r^ax — > as t 00 and r^ax 1 as t ^ 0. 

In Fig|2l we compare the distribution functions P{Re) and P{Ri2,S2 — si). When 
|si — S2\/L = 1 Eq. (|TT|l gives the end-to-end distribution for semifiexible chains. By adjusting 
the value of t (or equivalently Iq) we can go from flexible to intrinsically stiff chains. As the chain 
gets stiff there is a dramatic difference between the P(r; |si — S2\,t) and P(r; |si — S2I = L,t) 
(see Fig|21-(b)). Contact formation between interior segments are much less probable than 
cyclization process (compare the green and red curves with the black in Fig|21-(b)). Physically, 
this is because stiffness on shorter length scales (|si — S2|/i^ < 1) is more severe than when 
l-Si — S2\/L ~ However, when the chain is flexible (large t) the difference between the 

probability of contact between the interior segments and cyclization is small (Fig|21-(a)). In the 
limit of large t(oc L/lp) the Hamiltonian in Eq.Q describes a Gaussian chain for which the 
distance distribution between interior points remains a Gaussian. However, if excluded volume 
interactions are taken into account there can be substantial difference between P{x, \si — S2\,t) 
and end-to-end segment distribution even when t is moderately large. 
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IV. INTERIOR LOOP CLOSURE TIME USING KRAMERS THEORY 

Having obtained the effective potential between interior segments of semiflexible cliain we can 
evaluate Eq.dH) using F{R) = -ksTlogPiR) with P{R) given by Eq.^ with r = R/L. For 
clarity we have suppressed the dependence of P{R) on \s2 — The expression for the mean 
first passage time (Eq.(^) can be approximated by expanding the effective potential F{R) at 
the barrier top and at the the bottom as F{R) ^ F{Rt) — |V^F(i?)|R=/jj(i? — RtY + ■ ■ ■ and 

F{R) ^ F{Rb) + \V%F{R)\n=Ri,{R - Rbf H , respectively (FigEj). Evaluating the resulting 

Gaussian integrals yields the Kramers equation 

Tc ~ TKr = = exp (AF^/kBT). (14) 

D^\/lFiR)\n=R,V'^lFiR)\ 

R=Rt 

When evaluating the Gaussian integral at the barrier top with R = Rt (Fig|21), we assume that 
only the integral beyond R > Rt contributes to the result. In the overdamped limit the mean 
first passage time, which is roughly the inverse of the reaction rate, is determined by the barrier 
height {AF^ = F{Rt) — F{Rb)), and the curvatures of the bound state, the curvature at the 
barrier top, and the friction coefficient, that depends on D{= 2Dq) where Dq is the monomer 
diffusion coefficient. The curvatures of the potential at the bottom {Ra = Rb) or at the top 
= Rt) (Figia right panel) is obtained using v^VpWU^ = ^rT (tI - ) 
by imposing the condition VrF{R)\r=r^ = 0, i.e., + ^^^fy^) |h=Hc = 0. There is an 
uncertainty in the evaluation ^y'V'j^F{R)\ji=ji^ because F{R) does not really form a barrier at 
R = Rt- Thus, we assume that the curvature at the barrier top is ~ 1/Rt using dimensional 
analysis. We express lengths in terms of the persistence length = Iq. Setting ^'^^"^^^ = " 
-j^ = I, and = X the radial probability density is 



,(0) 



I 

-2 



V ' ' ^ (1 - 2f)9/2 4(s//)(l - 



with dxP{x, s,l) = 1. When the dimensionless contact radius Xt = a{= a/lf^) <^ 1, the 



exponential factor can be approximated as exp i—AF^ IkBT) = P(a, s, l)/P(xb, s, /) ~ "^^'''''.l . 

The function P{r) fEq. ljllj) ) is not appropriate for estimating the contact probability of 
semiflexible chains even though the overall shape of the mean field distribution function is in 
excellent agreement with the simulations and experiment. The contact probability for DNA is 
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well studied by Shimada and Yamakawa,^^ thus we use their result for G{0,1). If x = and 
s < 10 then the Shimada- Yamakawa equation gives a reliable estimate of the looping probability 
{G{0,s,l)=Go{s)). 

896 32 

Go(s) = — exp (-14.054/s + 0.246s) (16) 

At a large s{> 10) value an interpolation formula Go{s) ~ s^^/^ due to Ringrose et. alM 
can be used. Note that as the chain gets stiffer (/ decrease) Gq{s) decreases substantially 
indicating a great reduction in the loop formation probability for intrinsically stiff chains. It 
should be stressed that Eq. (fT^ has been obtained only for cyclization process and does not take 
into account the effect of dangling ends (Fig^(b)). The contact probability (r 0) between 
interior segments should be different from the one for the end-to-end contact. In other words 
G{r 0, s,/) should depend on s/l. Unfortunately, we do not know of analytical results for 
G(0,s, Z). We simply use Gq{s) for (7(0, s,Z) and resort to the values of and G{xb,s,l) to 
account for end effects. We validate the approximation that (7(0, s, /) does not depend on I 
{Gq{s) = G{0,s,l)) explicitly using simulations (see below). 

With these approximations the loop formation time in the presence of dangling ends (Fig^ 
(b)) is 

r.(., /) ^ {Ug{x„ s, l)/{% - ^]^^)^/n^^. (17) 
4 XI G{Xb,s,l) aV Go{s) 



where x = Xb = y ''"'"^^ ^^^ -sl and ^ is the adjustable parameter we introduced to account for 
the uncertainty in computing the curvature at the barrier top, i.e., \/'VlF {x)\x=xt=a = ^^^bT. 
Note that the structure of Eq. ()17|) is identical to our previous estimate of tertiary contact 
formation time used to interpret kinetics of loop formation in proteins T{n) ^ D^pln) where n is 
the loop length, (i?^) is the mean square distance between the two residues, Dq is an effective 
monomer diffusion constant, and P{n) is the loop formation probability .^i^^i^^ The differences 
between the two lie in the numerical prefactor inside {...}. In addition, in the dimensional 
argument used to obtain riji) we used (i?^) instead of 1^ that arises in the present theory. 

FiglSl shows that the estimates of looping time using Eq.(^7j) and the results of simulations 
for the same set of parameters are in excellent agreement when ^ 7.3 (see the next section 
for details of the simulations). First, t^. increases and converges to the finite value with the 
increasing size of end tails (decreasing s/l) and this trend manifests itself as the chain gets 
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stiff er and shorter (small s) (see Refs.^^ and^S). The inset shows that, at s = 3, Tc increases by 
a factor of ~ 1.5 when the total contour length of the dangling end is 5 times longer than the 
contour length of the loop. Second, Tc is a minimum (r™") when the contour length between 
loop formation sites, |si — S2I, is around (3 ~ 4) l^^ and r™*" shifts towards the large s value 
with the increasing size of dangling ends. Note that when the loop size becomes large (s > 6) 
Tc does not depend on the length of the dangling ends. In non-interacting Gaussian chains the 
equilibrium distribution of any two segment along the chain is always Gaussian. In this case, 
the presence of the dangling ends does not affect the chain statistics. 



V. SIMULATION OF LOOP CLOSURE DYNAMICS 

To check the validity of the theoretical estimates for loop closure time we performed simu- 
lations using a coarse-grained model for ds-DNA. The simulation procedure is identical to the 
one used by Podtelezhnikov and Vologodskii (see details in Ref.'^). Because the time scale of 
~ ms is computationally difficult to accomplish, even using Brownian dynamics (BD), we use 
a coarse-grained model of ds-DNA by choosing the pitch of the helix (10 base pairs with the 
diameter Iq = 3.18nm) as a building block of a ds-DNA chain. The energy for a worm-like chain, 
that is appropriate for ds-DNA, is taken to be the sum of the bending rigidity (Eb) term and 
the chain connectivity (Eg) term, which respectively are given by 

N-l 

Eb = aRT J2 (18) 

i=l 

and 

Es = ^f2(^^-ior. (19) 
i=i 

where T is the temperature, R is the gas constant, 6i is the i-th bond angle, k is the i-th bond 
length. By choosing the parameters a = 7.775 and /5 = 50 one can get the typical persistence 
length of 50nm for ds-DNA. 

Despite the simplification in the energy function computation of the looping time through 
direct BD simulation still is prohibitively difficult. From Eq. (fT7j) it is clear that the loop forma- 
tion time can be even up to 0{1) sec for certain values of Ip and L. To overcome this problem. 
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Podtelezhnikov and Vologodskii used the relation between the equihbrium probabihty of loop 
formation and the loop closure and opening times, 

Piro;\s2-si\,L)= ^° . (20) 
Tic + To 

To is the loop opening time of the closed loop. In general Tq ^ tjc- This observation enables 
us to perform direct BD simulation for the loop dissociation rather than loop closure. Since 
-^('"o; 1^2 ~ L) is normally very small for small tq [vq = 5nm), there is a sampling problem. 
However, P(ro; |s2 — Si|, L) can be found using the Markov relation 

n-l 

P(ro;|s2-si|,L) = J]P(r,|ri+i) (21) 

i=l 

where P(rj|rj+i) is the conditional probability that conformations with r < rj in the subset of 
conformations with r < rj+i. To obtain P{ri\riJ^i) we performed {n— l)-Monte Carlo samplings 
using the pivot algorithnt^ by iteratively adjusting the interval of the end-to-end (or interior- 
to-interior) distance Tq < ri < • ■ • < r„ such that P(rj|rj+i) ~ 0.2. 

The results of our simulations for P(ro; L) and Tq for both end-to-end contact and contact 
between segments are shown in FigEJ Note that there is minor difference between the contact 
probabilities of the end-to-end and of the interior-to-interior {\s2 — Si\/L = 0.5) segments 
whereas the loop opening dynamics for the chain with dangling ends is slower than the case 
without dangling ends by about ~50%. The independence of P(ro; \s2 — Si\,L) justifies the 
approximation, Gq{s) G{0,s,l), used in obtaining Eq. (fT7|) . The values of and tjc can 
be computed knowing P(ro; \s2 — si\,L) and Tq. The results, which are shown in FiglHl are in 
excellent agreement with theory. 



VI. INTERIOR LOOPING DYNAMICS IN SEMIFLEXIBLE CHAIN WITH 

VARIABLE PERSISTENCE LENGTH 

In many cases stiffness of the loop, which is involved in interior looping, is different from 
the overall persistence length of the chain. A simple example is the formation of a /5-hairpin 
in peptides. In this case, the stiffness of the loop is typically less than the /^-strands. If the 
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/3-hairpin-forming polypeptide chain is treated as a WLC then it is characterized by three 
fragments, namely, the loop with a persistence length lp2 and the strands whose persistence 
lengths are Ipi and Ips. Such a variable persistence length WLC is also realized in the DNA- 
RNA-DNA construct in which lp2 ~ 10 A and Ipi ~ Ips ~ 500 A. Constructs consisting of three 
WLC fragments are also used routinely in laser optical tweezer experiments. 

The Kramers type theory, used to calculate interior looping in WLC with uniform Ip, can 
be adopted to compute tjc in WLC with variable Ip. The mean field equilibrium distribution 
function P(r) (with r = Ri2/\s2 — Si\), which is needed to calculate tjc is (see Appendix III for 
the derivation) 

where L2 = |s2 — Si| is the contour length of the loop part of WLC with persistence length lp2, 
L is the total contour length of the chain, t = L/fJ^. The effective persistence length of the 
WLC, consisting of three segments < s < si with Zpi, si < s < S2 with /p2, and S2 < s < L 
with /p3 is 

1 

In the mean field approximation, the WLC in which Ip varies along the contour in a discrete 
manner, is equivalent to a WLC with an effective persistence length. It follow from EqOHlthat 
the effective persistence length is determined by the smallest Ip^. 

Consider the simplest case L2 = L^nt, Li = L3 = kzLim^ _ _ iH ^ ^^^^ _ ^^o)_ j^^ ^j^jg 
case, 

l^ff 1 

' - (24) 




(23) 




1 X]\ Jn\ + X 



/(O) 

Lp 

where x = Lint/L. It follows from EqI211 that if lp > l^^ (handle is stiffer than the loop) 
then Ip-^'^ > Because in the Kramers description tjc is controlled by Ip-^^ we expect that 
interior looping time is greater than tjq for a chain with uniform lp. In the opposite limit, 
lp < l^\ (handle is softer than the loop) /^-^-^ < l^\ Consequently, attaching a soft handles 
should enhance the rate of interior looping. 

The interior looping kinetics for a WLC copolymer for different values of the loop and handle 
persistence lengths are shown in Figl^l In accord with the arguments given above, we find 
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that when /l^p^ = 2 (stiff handles) Tic increases substantially compared to tic{= Tjq) for 
Ip /l^^ = 1 for all values of |s2 — Si\/l^p^ (FigEl)- Similarly, when the handle is softer than that 
of the loop, Tjc decreases appreciably compared to r^^ which is the interior looping the case 
when the chain has uniform stiffness. In the interesting regime of |s2 — si\/l^p^ ~ (2 — 4) we 
predict a dramatic increase in tjc compared to r/^ when Ip /l^^ > 1 and a substantial decrease 
in Tjc/rffj when Ip /if^ < 1 (see inset in Fig|H)). Thus, stiff handles retard interior loop kinetics 
whereas soft handle enhances rates of interior loop formation. 



VII. EFFECT OF MONOMER-MONOMER INTERACTION ON LOOP CLOSURE 

TIMES 

Majority of the recent experiments on dynamics of loop formation have been analyzed using 
simple polymer models that do not explicitly consider interaction between monomer segments. 
In a number of cases there are physical interactions between monomers. For instance, DNA 
is charged and the interaction between the monomers can be approximately described using 
the short ranged Debye-Hiickel potential. Similarly, solvent-mediated interactions also arise 
especially when considering proteins. For these reasons it becomes necessary to consider an 
interplay between chain stiffness, entropic fluctuations of the polymer and nonlocal interaction 
between monomer segments. 

The non-linear problem, that arises from the constraint v?{s) = 1, in a non-interacting 
semiflexible chain is further exacerbated when interactions between monomers are taken into 
account.— To circumvent this problem we assume that the effect of intra-chain interaction is to 
only alter the effective persistence length. We compute the loop closure kinetics using Eq. lfTTj) 
with a renormalized persistence length that explicitly depends on the nature of the interac- 
tion between the monomers. This approximation is in the same spirit as the local equilibrium 
assumption used in this study. 

To calculate the renormalized persistence length Ip we follow the procedure due to Hansen 
and Podgornik^ who used a mean- field weight (similar to \I'A/i?(u(s)) in the presence of non-local 
interaction, V{y{s) — r(s')) between monomers s and s' . The standard field-theoretic procedure 
is to use the Hubbard- Stratonovich transformation via auxiliary fields to eliminate the non- 
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= / dzz'e-^^/^ ( = + =57^ ) (26) 



Markovian nature of V{r{s) — r(s')). Using stationary phase approximation to evaluate the 
optimal values of the auxiliary fields they^ obtained an expression for l^^^ for arbitrary potential 
V{r{s) — r(s')). In our applications, we assume that the charged monomers interact via the 
screened Coulomb interaction V{r) = -^^^^^-^ where Ib is the Bjerrum length {e^ / ATreoksT = 
Ib), and A is the effective separation between charges on the monomer. The use of Debye-Hiickel 
potential is appropriate when considering ds-DNA in monovalent (Na"*") counterions. With this 
choice of V{r) the renormalized persistence length becomes = + and is given by^ 

= /f +Fi(/f )2/(L//f ,0 (25) 

with 

where Fi = lB/{l2V2dA'^), B{z) = z-l + e-^ and ^ = 1/{V2kII^^). The integral I^L/lf^ ,1) 
has different asymptotic behavior depending on the two parameters kI^^^ and L/lp^\ (i) If 
Ip^^ > (^ < 1), i.e., the persistence length is greater than the screening length, then the 
contribution due to electrostatic interaction can be treated perturbatively. In this case the 
upper limit of the integral in Eq. ()26p is effectively set to infinity. We find B{z) ^ z"^ and 
I{L/ll^\^) ~ which is also small since ^ < 1. Therefore, Slp^^ ~ ^2^2 ; which coincides with 
the OSF result . ^^1^^ For electrostatic contribution to persistence length of a polyelectrolyte chain 
the limit l^,^^ > is most appropriate for DNA . (ii) if l^^^ < k^^ (,^ > 1), i.e., the persistence 
length is smaller than the screening length, there is substantial interaction between the chain 
segments beyond the length scale of /p . We believe this situation is difficult to be realized in 
experiments involving biopolymers. In this case, the integral up to 2; = becomes important. 







42- 

{R) 



B{z) ~ z and I{L/lf\C) ~ ^ • Therefore, 51);'' ~ 



To calculate loop closure times the renormalized persistence length Ip is numerically com- 
puted for each parameter set (contour length L, inverse screening length and we use 
(Eq.(^7j)) with if'^ lp^\ For the ds-DNA in the monovalent salt solution (concentration c) 



the parameters in the semiflexible chain model are Ib = 7.lA, A = 1.7A and k = a/SttIbc. The 
results for and tjc are plotted in FigsIHl 13 First, the cyclization times are computed as a 
function of L at various salt concentrations (FigEj). We find that Tc shows a dramatic increase as 
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c is varied at small values of L/lp . The electrostatic repulsion retards loop closure times, as the 
salt concentrations (strong inter-segment repulsion) decrease (FigEJ. Because of the interplay 
between bending rigidity and chain entropy Tc has a minimum at t = t*. The value of t* shifts 
from t* = 3 to t* = 6 as c decreases. The inset in FiglHl shows that there is practically no change 
in Tc at t = 3 if c > 100 mM, which is near the physiological concentration (150 mM Na^). 
In this range of c the electrostatic contribution to the persistence length is small so that Ip is 
almost the same as Note that the c corresponding to the condition IpH = 1 for Ip = 50 nm 
is c ^ 40 mM. 

The dependence of tjc, which examines the effect of the dangling ends, at high and low 
concentrations and various values of |s2 — on |s2 — si| is shown in Figd The insets of 

FigElshow an increased time scale at low salt concentration (10 mM) compared with high salt 
concentration (500 mM). When tjc for |s2 — Si|/L = 0.2 is compared with |s2 — = 1, Tc 

increases by a factor of ~ 2.7 at c = 10 mM whereas the increase is about a factor of ~ 1.5 
for c = 500 mM. The effect of dangling ends on loop formation dynamics manifests itself more 
clearly at low salt concentration when electrostatic repulsion is prominent and at small ratio of 

\S2 - Si\/lp\ 

VIII. DEPENDENCE OF ON THE MONOMER-MONOMER INTERACTION 

POTENTIAL 

( 

In our theory and tjc can be determined provided Ip and the distance distribution 
functions are known. To examine the variation of Ip^^ on the nature of monomer- monomer in- 
teractions we have computed Ip for different potentials V{r). Sets of equilibrium conformations 
of 50-mer bead-spring model (see Eq. lfTKjl .lfTI Hl ) are generated with different bending rigidity, hib, 
and with different non-local potentials V{r) = l/r"(a =1, 2, 4, 6, 12). In each case the effective 
persistence length Ip^^ is computed by where (6) is the ensemble average of the angle 

formed by three consecutive beads. We show the simulated radial distribution function and the 
effective persistence lengths for different values for the various ranges of nonlocal interaction 
in Fig|Hl-(a),(b). The results from radial distribution function (FiglHl-(a)) and the persistence 
length show that Zp^^ ^ l^^ when the interaction is short-ranged, i.e., a > 4. When a < 4 
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then the effective interaction between monomers leads to an increase in the persistence length 
(FiglHJ-(b)). These results are consistent with the field theoretical approach by Hansen and 
Podgornik.— Considering that the excluded volume potential is of short range nature (modeled 
using ~ r~^^ or delta function) we conclude that the excluded volume effect on the looping 
dynamics of rigid polymer chain is negligible. Note that the screened electrostatic potential 
V{f^) = can be either a short or a long range potential depending on the value of k. 

These calculations, especially changes in Ip as the range of interaction is altered, explain 
the reason that a simple WLC model works remarkably well in a number of applications. 
For example, the response of DNA, RNA ,^^i^^ and proteinsSZ, to mechanical force has been 
routinely analyzed using WLC. Surprisingly, recent analysis of small angle x-ray scattering 
measurements^ on ribozymes have shown that the distance distribution function can be 
quantitatively fit using P{R^). In these biopolymers the interactions that determine the 
conformations are vastly different. However, the results in Fig|Hl-(b) show that as long as these 
effective interactions are short-ranged Ip should not differ from the bare persistence length. 
This key result rationalizes the use of WLC in seemingly diverse set of problems. 



IX. CONCLUSIONS 

In this paper we have used theory and explicit simulations of worm-like chains to examine 
loop formation dynamics with emphasis on kinetics of contact formation between monomers that 
are in the interior of the chain. The Kramers theory, adopted to describe looping time scales 
using the analytically computed potential of force between the contacting (or reacting) groups, 
gives results that are in quantitative agreement with simulations. The theory-'' for tjc and Tc 
contains one parameter that was introduced to account for the uncertainty in the estimate of the 
frequency at the transition state (FigI2I) in the intramolecular reaction (Fig^). The present study 
also provides a justification for the use of Kramers-like theory in describing looping dynamics 
by explicit comparison with simulations of semiflexible chains. Although several questions of 
fundamental theoretical importance remain previous studies, beginning with the pioneering work 
by WF, and the present study have given a practical analytic formula to analyze most of the 
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recent experimental data on proteins and DNA. We conclude the paper with a few additional 
comments. 

1. The present work and several previous studies , ^^i^'^'i^'^'i^^ which have examined the effect 
of stiffness on looping dynamics, have shown that the rates of cyclization and interior 
looping must slow down as the loop length becomes small. In other words, there must 
be a turnover in the plot of ka {a = c or IC) as s decreases (see FigO] in which tjc as 
a function of s is shown). For the parameters used in Fig|21the turnover occurs around 
s ~ (3 — 4). Such a crossover has been observed in the cyclization kinetics of DNA and in 
simulations of worm like chain models. When s is small the time scales for loop formation 
can be substantially large (~ 0{1) sec). 

The effect of stiffness on cyclization rate in disordered peptides has also been emphasized.— 
For the construct Cys — {Ala — Gly — Gln)j — Trp with j from 1 — 6 the stiffness effects 
are evident at j ~ 3.^^ However, these authors did not observe the theoretically predicted 
turnover in this construct for which the persistence length is estimated to be Ip ~ 0.7nm. 
Using the results in Fig. (3) we predict that the turnover must occur only when the 
number of peptide bonds is less than about 3. This limit has not been reached in the 
experiments by Lapidus et. a/.— For the construct {Gly — Ser)j Hudgins et. al.^'^ have 
clearly observed a turnover when the number of peptide decreases below about 4. The 
observation of Hudgins et. al. is consistent with our prediction that turnover in cyclization 
rates in disordered polypeptides occurs when s < 3. When the number of residues in the 
polypeptides chain becomes too small then measuring Tc using bulky donor-acceptor pairs 
in FRET experiments is difficult. In this situation other methods^^ could be used. 

2. For the parameters used in Fig. (3) the difference between and tic is no more than about 
a factor of four. However, if charged interactions between monomers become relevant then 
Tic can be very different from r^. At both low and high values of the salt concentration 
the Tc and tjc can differ by nearly an order of magnitude (see Fig. (6)). These variations 
are significant because Tc in polypeptide chains studied thus far varies by less than a factor 
of ten as the number of residues is varied from 5 to 20. It would be interesting to probe 
looping dynamics by varying the net charge on polypeptides. We should also stress that as 
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the salt concentration increases the electrostatic interactions in a high dielectric medium 
are effectively short ranged. In this case tjc is determined essentially by the bending 
rigidity of the backbone (FiglHI). 

3. The Kramers based theory for Tc and tjc is a convenient way to measure persistence length 
of polypeptide chains as a function of temperature and denaturant concentration. Recent 
measurements suggest that Ip depends on urea concentration.^^ More importantly, there 
appears to be strong sequence effects in which, at the level of polymer-based theories, 
must reflect changes in Ip. For example, Tc for polyproline deviates substantially from ideal 
chain behavior.^^ Similar measurements of Tc for other polypeptides along with the simple 
theory can be used to extract how Ip varies with sequence. 

4. The dependence of interior looping time on the ratio of the persistence lengths of loop 
and the handle shows that in the interesting range of s ~ (2 — 4) tjq can be substantially 
larger than Tjq for a chain in which Ip /l^^ = 1 (see inset in FigEl). This case is directly 
applicable to /3-hairpin formation that is controlled by formation of a loop with persistence 
length that is less than that of the strands.— From Figl^l it follows that as the stiffness 
of the loop increases the interior looping time also increases. This conclusion is in accord 
with explicit simulations of coarse-grained models of /?-hairpin formation that showed that 
enhancement of loop stiffness retards rate of /3-hairpin formation.— 

APPENDIX A 

In this Appendix we outline the steps leading to Eq.®. The distribution function G{Ri2, Si — 
S2) is 



duodus^ dus2 dui 





X e 4^ 



2A'"'^ + 2A' 




(Al) 
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Using the expression Eq.(@]) for Z{us, uq, s) and carrying out the integrals over the u variables 
Eq. ()A.l|) becomes 

AL+2(5 /^^ '^^^ /7rsinh^7(s2 — '^^^ /vr sinh — S2) ^ ^^'^^ 

G(iti2, si — S2) = e 



d^k / k^ 

■exp I ik ■ R12 - 7-r|s2 - Si| + — ■fip(cothfi(s2 - Si) - 



(27r)3^^^V .X, , 2A2^^PV^-^-^^v^- sinhfi(s2-si: 

TT \3/2 /l \ 

^ exp ^b^(k)^-ib(k) (A.2) 



AetAJ V4 

where 

/ 



A 



s + n,cothns^ 

r!p(cothf^si+cothr!(s2-si)) - .inho?r,-s,) 



- sinhOU;-..) 17p(c0thf}(s2-,Si)+COthf7(L-,S2)) 



- sinhn(L-«.) ,5 + 17pCOthf](L-52)y 

(A.3) 

and 

ik_ . . 1 

sinhr2(s2 — Si] 

The integration with respect to k leads to 

'7rsinhi7si\ '^^^ /tt sinh fi(s2 — Si) \ /vr sinhf2(L — 32)^ 



= yfip(cothr](s2 - Si) - g-^^^/^ _^ J (0110). (A.4) 



G{Ru,Si,S2) = e 



XL+25 



TT \3/2 1 / 47rA2 ^^/^ 



Vdet^y (27r)3 Vg(si,S2;A,(5).' ^ ^ 



where 



^2 g- 

g(si, S2; A, 5) = |S2 - si|A - 2fip(coth fi(s2 - si) - ^.^^^^^^ _ ^^^ ) + M(si, S2), (A.7) 

and 

O sinh tanh^ ''^^^ 



M(si,S2;A,(5) 



25fip cosh fiL + (52 + (]2) sinh (]L 



X {4*n, cosh "'^^ - - + 2(i'^ + ) smh "'^^ " '^^ ' 



"P— 2 i -V" i -"p7 - — 2 

(5^ - (sinh ^('"^ - - '^^^ - sinh ^^^^-^^-^^-)^ I ( A.8) 
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In limit L S2 ^ Si ^ 1, M{si,S2) — * flp and Q{si, S2] X,S) — * |s2 — si|A — Qp. As a 
result of translational symmetry along the chain G{R, Si, S2) = G{R, \s2 — Si\). Using JF(A, 6) ~ 
— In G'(i?i2, si, S2) leads to Eq.®. 



APPENDIX B 



In obtaining the stationarity condition to evaluate A and 5 we first took the thermodynamic 
limit {L —>■ 00) and then calculated the optimal values of A and 6. It is technically necessary to 
solve the stationarity condition ^ = ^ = before taking the L ^ 00 limit. In this appendix 
we examine the consequence of taking the thermodynamic limit after solving for optimal values 
of A and S. For simplicity, set |s2 — Si| = L. The variational equations for A and 6 become 



and 



5 + a 



L2 ' - 



3 
2 



9 



2fi, 



(B.l) 



(B.2) 



From the second relation we find two roots for Qp, namely, 

3(1 + 1) ■ 



A±L 



4(1 




12(1 -r2) 
W 



t(l 



(B.3) 



where t = L/Iq, /q = and r = R/L. There are no restrictions on the values of L and Ip in 
Eq.((Z|) in which the thermodynamic limit is taken first. However, when the order of operation 
is exchanged there is a possibility that the two roots are that can be imaginary. For we 
retrieve the same stationary phase condition as Eq.(jHl) only if L ^ lp{t ^ 1) The second root 
Qp = but this can be discarded since A 7^ 0. Although there are multiple saddle points, we 
can always deform the contour such that the contour passes the saddle point with J-''{Q^) = 
which satisfies the stationarity condition. In addition, ^^(10 — t — j) < < 1 should be always 
satisfied for Qp to be real (Eq. ()B.3j) ). 
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APPENDIX C 



We calculate P(Ri2, |s2 — for a semiflexible chain in which the persistence length of the 
loop is different from that of the dangling ends. The chain can be thought of as a triblock 
WLC copolymer which is an appropriate model for RNA hairpins or /3-hairpins in disordered 
polypeptides. The interior R12 distribution function is 

G(Ri2;Ivi,L2,L3) = (5(Ri2 - / u{s)ds))MF 

_ /D[ll(s)]^(Ri2 - ^ uis)ds)^MFHs)] 

jD[uis)]^MFHs)] ' ^ ■ ^ 

where u(s) is a tangent vector at position s. Suppose the chain consists of three different parts 
characterized by the persistence lengths (Ipi) and the contour lengths (Lj) with i = 1,2,3 {L = 
^^^^ Li). The exact weight *[u(s)] oc exp [- ^^^^ ^ /^^ ds (g)^] H S{u\s) - 1) is replaced by 
the mean field weight \E'A/i7'[u(s)]. 



i=\ ^ Li \ / 1=1 ^ Li 



(C.2) 

As in Appendix A, the Lagrange multipliers A and 6 are used to enforce the local constraint 
u^(s) = 1. Following, exactly the procedure outlined in Appendix A we find that in the L 00 
limit, the analogue of Eq|7| becomes. 



J^[X,6] ^ -(LA + 25) 



i=l 

L 



+ - In [{S + + fip2)(f^p2 + ^p3)i5 + a 



p3) 



3 , L2 Rl2'^ 
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p3 



L2 Rx2 ■ 



+ 1 



+ 1 



- lnL2 
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(C.3) 



where f2j = a/2A/7^ and ilpi = Vlilpi/2. The major contribution to the integral comes from the 
sets of A and 5 which pass the saddle point of a stationary phase contour on Re{jF} plane. The 
stationary condition for A by taking derivative with respect to A by retaining the leading term 
in L, which leads to 

3 



4(1 




(C.4) 
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where r = Similarly, the condition for 5 results 



1 1 

+ 



p3 



(C.5) 



Substituting Eq iC.4l and lC.5l to G{R) ~ exp (— JF[R; A, 6]) gives the desired distribution function, 



G{Ri2] {Ipi}, {Li}) 



N 



(1 - :^r2)9/2 



exp 



El 1 Li 



(C.6) 



The effective persistence length in the mean field approximation is 



jeff 



L, 1 



,=1 L 



(C.7) 



The result derived for a triblock WLC can be generalized into A^-block WLC with persistence 
lengths {Ipi} and contour lengths {Li}. The distribution function is 



P r) 



exp 



3t 1 



{CI 



where t = L/l^ff, l^ff = | Ef=i 



L, 1 



r = -R12/IS2 — si| and C is a normalization 



constant. 
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FIGURE CAPTIOINS 

Figure^: Loop formation in semiflexible polymer chains: (a) Cyclization event, (b) Interior 
contact (IC) formation between monomers Si and S2 in the chain interior. The segment length 
Si and L — S2 are referred to either as dangling ends or handles in the text. 

Figure El : Comparison between the interior distance distribution functions for different 
size of the loops {^-^^jp^ =1, 0.8, 0.5). The value of \s2 — Si\/L = 1 corresponds to end-to- 
end distribution function. The flexible chain limit {t = 10) is on the left and the right panel 
is for stiff chains {t = 2). The panel on the right shows a sketch of the effective potential 
F{r) = —kBT\ogP{r) for the case |s2 — Si\/L = 0.8. The wells at Rb and the barrier top Rt 
are highlighted. 

Figure 01 : Plots of loop formation time {tjc) obtained using Eq.()17p as a function of the 
distance between site si and S2 expressed in terms of l^^ for various size of dangling ends 
expressed by s/l{= \si — S2\/L). Here / = L/l^^ and s = |s2 — si\/l^\ The specific values for 
parameters are l^^ = 50 nm, D = 2Dq = 1.54 x 10~^^m^/s and a = 0.1. The insets are for the 
loop closing time at \si — S2\/l^^ = 3 as a function of s/l. For two sites separated by — S2I along 
the chain the longer loop closing time is expected if the sites of interest are connected by long 
dangling ends. If the separation is much larger than the persistence length — S2I >> l^^) the 
effect of dangling end on loop closing time vanishes. The results of simulations for the same set 
of parameters are shown in symbols. The excellent agreement between theory and simulations 
validates the assumptions leading to Eq. lfTTj) . 

Figure HI: Plot of looping probability, P(ro; S), (left) and dissociation time, Tq, (right) as a 
function of interior loop contour length 5*. The capture radius ro = 5nm. The simulations are 
performed for both end-to-end {S/L = 1) (black circle) and interior-to-interior {S/L = 0.5) (red 
triangle). The parameters of the semiflexible chains are the same as in FigOl Note the loop 
dissociation time is much shorter than tjc- 

Figure El: Interior looping time {tjc) as a function of the reduced distance between sites Si 
and S2 for the dangling ends of s/l = 0.3 for WLC with variable persistence lengths {Ip /Ip'^ = 2 
in green, triangle up, Ip /l^p^ = 1 in red thick line, and Ip /if^ = 0.5 in blue, triangle down). The 
physical situation corresponds to FigHlin which the persistence length of the two handles (0, si) 
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and {s2,L) is 1^ and the contour lengths of three segments are identical. For comparison the 
cyclization time (r^ with s/l = 1 and the chain persistence l^^) as a function of s{= I = L/l^^) 
is shown in black dashed line. The inset shows the ratios of looping times {tjc) for the chain 
with variable persistence length with respect to the looping time (t/^) of the chain with uniform 
persistence. The up triangle in green is for the ratio between Ip /if^ = 2, s/l = 0.3 and 
Ip /if^ = 1, s/l = 0.3 (stiff handle), and the dpwm triangle down in blue is for the ratio between 
= 0.5, s/l = 0.3 and = 1, s/l = 0.3 (soft handle). 

Figure IHl : Plots of cyclization time ( 'Tq ^ clS db function of L (expressed in terms of l^p^) for 
various salt concentrations. The same parameters {l^\ D, a) with those in Figl^lare used. The 
inset shows Tc at L/lf^ = 3 as a function of c. The cyclization time Tc increases sharply below 
c < 50mM. 

Figure I?] : Interior looping time {tic) as a function of the reduced distance between site 
Si and S2 for various size of dangling ends under two salt concentrations. The length of the 
dangling end is given by the parameter s/l{= \si — S2\/ L). The same parameters {l^p \ -D, a) as 
in Fig|21are used. The values of the salt concentration (lOmM, 500mM) are explicitly shown. At 
each value of c the different curves correspond to distinct values of s/l. The values of s/l range 
from 0.2 to 1. The inset shows tjc, at the two values of c, as a function of s/l for \s2 — S2\/l^p^ = 3 
(the vertical dashed line). 

Figure |H1 : (a) End-to-end radial distribution function for a semiflexible chain with bending 
rigidity = 3, 10, 20, 50, and 100 in unit of ksT) for various non-local interaction potentials 
between monomers. The form of the potential is V{r) = r""". with a = 1,2,4,6,12: Results 
are obtained using Brownian dynamics simulation using the energy function Ewlc = Eb + Es + 
^i<j '^^^ effective persistence length for various potentials V{r) at different values 

of the bare bending rigidities. 




(b) 
FIG. 1 



(a) t(=|S2-sJ/lo)=10 (b) t(=|S2-sJ/lo)=2 
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s{=\s^-s,\ir"') 
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